Overview
Brought to you by YData
Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 331267 |
| Missing cells | 2210825 |
| Missing cells (%) | 35.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 48.0 MiB |
| Average record size in memory | 152.0 B |
Variable types
| Text | 2 |
|---|---|
| Categorical | 3 |
| Numeric | 13 |
| Boolean | 1 |
MRG has constant value "False" | Constant |
ARPU_SEGMENT is highly overall correlated with FREQUENCE and 7 other fields | High correlation |
CHURN is highly overall correlated with REGULARITY | High correlation |
FREQUENCE is highly overall correlated with ARPU_SEGMENT and 6 other fields | High correlation |
FREQUENCE_RECH is highly overall correlated with ARPU_SEGMENT and 6 other fields | High correlation |
FREQ_TOP_PACK is highly overall correlated with ARPU_SEGMENT and 6 other fields | High correlation |
MONTANT is highly overall correlated with ARPU_SEGMENT and 7 other fields | High correlation |
ON_NET is highly overall correlated with ARPU_SEGMENT and 4 other fields | High correlation |
ORANGE is highly overall correlated with ARPU_SEGMENT and 6 other fields | High correlation |
REGULARITY is highly overall correlated with ARPU_SEGMENT and 7 other fields | High correlation |
REVENUE is highly overall correlated with ARPU_SEGMENT and 7 other fields | High correlation |
TENURE is highly imbalanced (86.4%) | Imbalance |
REGION has 130539 (39.4%) missing values | Missing |
MONTANT has 116364 (35.1%) missing values | Missing |
FREQUENCE_RECH has 116364 (35.1%) missing values | Missing |
REVENUE has 111611 (33.7%) missing values | Missing |
ARPU_SEGMENT has 111611 (33.7%) missing values | Missing |
FREQUENCE has 111611 (33.7%) missing values | Missing |
DATA_VOLUME has 163104 (49.2%) missing values | Missing |
ON_NET has 121131 (36.6%) missing values | Missing |
ORANGE has 137819 (41.6%) missing values | Missing |
TIGO has 198144 (59.8%) missing values | Missing |
ZONE1 has 305097 (92.1%) missing values | Missing |
ZONE2 has 310242 (93.7%) missing values | Missing |
TOP_PACK has 138593 (41.8%) missing values | Missing |
FREQ_TOP_PACK has 138594 (41.8%) missing values | Missing |
DATA_VOLUME is highly skewed (γ1 = 28.11494861) | Skewed |
user_id has unique values | Unique |
DATA_VOLUME has 49051 (14.8%) zeros | Zeros |
ON_NET has 16552 (5.0%) zeros | Zeros |
ORANGE has 9496 (2.9%) zeros | Zeros |
TIGO has 14458 (4.4%) zeros | Zeros |
ZONE1 has 9212 (2.8%) zeros | Zeros |
ZONE2 has 6217 (1.9%) zeros | Zeros |
Reproduction
| Analysis started | 2025-04-28 17:37:06.158209 |
|---|---|
| Analysis finished | 2025-04-28 17:38:03.459185 |
| Duration | 57.3 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
user_id
Text
Unique 
| Distinct | 331267 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.5 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 40 |
| Mean length | 40 |
| Min length | 40 |
Unique
| Unique | 331267 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 00000bfd7d50f01092811bc0c8d7b0d6fe7c3596 |
|---|---|
| 2nd row | 00000cb4a5d760de88fecb38e2f71b7bec52e834 |
| 3rd row | 00001654a9d9f96303d9969d0a4a851714a4bb57 |
| 4th row | 00001dd6fa45f7ba044bd5d84937be464ce78ac2 |
| 5th row | 000028d9e13a595abe061f9b58f3d76ab907850f |
| Value | Count | Frequency (%) |
| 00001dd6fa45f7ba044bd5d84937be464ce78ac2 | 1 | < 0.1% |
| 276e99744062bb493e52ef26a09e1dfc5446cea6 | 1 | < 0.1% |
| 00000bfd7d50f01092811bc0c8d7b0d6fe7c3596 | 1 | < 0.1% |
| 276e50b83a7dc315a086963feb70ba55751399de | 1 | < 0.1% |
| 276e516c979a6afa607b9cbd7eab62216652f44b | 1 | < 0.1% |
| 276e53f7498551fff1234d9e709889c84405ba78 | 1 | < 0.1% |
| 276e56504fe3751cd86222993b4a457a5058e043 | 1 | < 0.1% |
| 276e59a531eba872a57d29ea94b225fc898a8a61 | 1 | < 0.1% |
| 276e5a3c538792af173d795fe57f3331bc3d52a6 | 1 | < 0.1% |
| 276e5b3ea0f146740ea1df8a4c349c14d9e17f69 | 1 | < 0.1% |
| Other values (331257) | 331257 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 946513 | 7.1% |
| 1 | 945963 | 7.1% |
| 2 | 876025 | 6.6% |
| 3 | 812771 | 6.1% |
| 6 | 812255 | 6.1% |
| 5 | 811817 | 6.1% |
| 4 | 810901 | 6.1% |
| 7 | 807111 | 6.1% |
| 8 | 804606 | 6.1% |
| b | 804423 | 6.1% |
| Other values (6) | 4818295 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 13250680 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 946513 | 7.1% |
| 1 | 945963 | 7.1% |
| 2 | 876025 | 6.6% |
| 3 | 812771 | 6.1% |
| 6 | 812255 | 6.1% |
| 5 | 811817 | 6.1% |
| 4 | 810901 | 6.1% |
| 7 | 807111 | 6.1% |
| 8 | 804606 | 6.1% |
| b | 804423 | 6.1% |
| Other values (6) | 4818295 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 13250680 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 946513 | 7.1% |
| 1 | 945963 | 7.1% |
| 2 | 876025 | 6.6% |
| 3 | 812771 | 6.1% |
| 6 | 812255 | 6.1% |
| 5 | 811817 | 6.1% |
| 4 | 810901 | 6.1% |
| 7 | 807111 | 6.1% |
| 8 | 804606 | 6.1% |
| b | 804423 | 6.1% |
| Other values (6) | 4818295 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 13250680 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 946513 | 7.1% |
| 1 | 945963 | 7.1% |
| 2 | 876025 | 6.6% |
| 3 | 812771 | 6.1% |
| 6 | 812255 | 6.1% |
| 5 | 811817 | 6.1% |
| 4 | 810901 | 6.1% |
| 7 | 807111 | 6.1% |
| 8 | 804606 | 6.1% |
| b | 804423 | 6.1% |
| Other values (6) | 4818295 |
REGION
Categorical
Missing 
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 130539 |
| Missing (%) | 39.4% |
| Memory size | 2.5 MiB |
| DAKAR | |
|---|---|
| THIES | |
| SAINT-LOUIS | |
| LOUGA | |
| KAOLACK | |
| Other values (9) |
Length
| Max length | 11 |
|---|---|
| Median length | 5 |
| Mean length | 6.3245188 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | FATICK |
|---|---|
| 2nd row | DAKAR |
| 3rd row | DAKAR |
| 4th row | LOUGA |
| 5th row | LOUGA |
Common Values
| Value | Count | Frequency (%) |
| DAKAR | 78959 | |
| THIES | 27749 | 8.4% |
| SAINT-LOUIS | 18394 | 5.6% |
| LOUGA | 15244 | 4.6% |
| KAOLACK | 14901 | 4.5% |
| DIOURBEL | 10306 | 3.1% |
| TAMBACOUNDA | 8440 | 2.5% |
| KAFFRINE | 6790 | 2.0% |
| KOLDA | 5879 | 1.8% |
| FATICK | 5688 | 1.7% |
| Other values (4) | 8378 | 2.5% |
| (Missing) | 130539 |
Length
| Value | Count | Frequency (%) |
| dakar | 78959 | |
| thies | 27749 | 13.8% |
| saint-louis | 18394 | 9.2% |
| louga | 15244 | 7.6% |
| kaolack | 14901 | 7.4% |
| diourbel | 10306 | 5.1% |
| tambacounda | 8440 | 4.2% |
| kaffrine | 6790 | 3.4% |
| kolda | 5879 | 2.9% |
| fatick | 5688 | 2.8% |
| Other values (4) | 8378 | 4.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 273845 | |
| K | 127277 | |
| D | 104230 | 8.2% |
| R | 99382 | 7.8% |
| I | 94462 | 7.4% |
| O | 77296 | 6.1% |
| S | 65024 | 5.1% |
| L | 64724 | 5.1% |
| T | 64676 | 5.1% |
| U | 56516 | 4.5% |
| Other values (10) | 242076 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1269508 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 273845 | |
| K | 127277 | |
| D | 104230 | 8.2% |
| R | 99382 | 7.8% |
| I | 94462 | 7.4% |
| O | 77296 | 6.1% |
| S | 65024 | 5.1% |
| L | 64724 | 5.1% |
| T | 64676 | 5.1% |
| U | 56516 | 4.5% |
| Other values (10) | 242076 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1269508 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 273845 | |
| K | 127277 | |
| D | 104230 | 8.2% |
| R | 99382 | 7.8% |
| I | 94462 | 7.4% |
| O | 77296 | 6.1% |
| S | 65024 | 5.1% |
| L | 64724 | 5.1% |
| T | 64676 | 5.1% |
| U | 56516 | 4.5% |
| Other values (10) | 242076 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1269508 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 273845 | |
| K | 127277 | |
| D | 104230 | 8.2% |
| R | 99382 | 7.8% |
| I | 94462 | 7.4% |
| O | 77296 | 6.1% |
| S | 65024 | 5.1% |
| L | 64724 | 5.1% |
| T | 64676 | 5.1% |
| U | 56516 | 4.5% |
| Other values (10) | 242076 |
TENURE
Categorical
Imbalance 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.5 MiB |
| K > 24 month | |
|---|---|
| I 18-21 month | 7009 |
| H 15-18 month | 3980 |
| G 12-15 month | 2313 |
| J 21-24 month | 1944 |
| Other values (3) | 1814 |
Length
| Max length | 13 |
|---|---|
| Median length | 12 |
| Mean length | 12.044746 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | K > 24 month |
|---|---|
| 2nd row | I 18-21 month |
| 3rd row | K > 24 month |
| 4th row | K > 24 month |
| 5th row | K > 24 month |
Common Values
| Value | Count | Frequency (%) |
| K > 24 month | 314207 | |
| I 18-21 month | 7009 | 2.1% |
| H 15-18 month | 3980 | 1.2% |
| G 12-15 month | 2313 | 0.7% |
| J 21-24 month | 1944 | 0.6% |
| F 9-12 month | 1391 | 0.4% |
| E 6-9 month | 287 | 0.1% |
| D 3-6 month | 136 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| month | 331267 | |
| k | 314207 | |
| 314207 | ||
| 24 | 314207 | |
| i | 7009 | 0.5% |
| 18-21 | 7009 | 0.5% |
| h | 3980 | 0.3% |
| 15-18 | 3980 | 0.3% |
| g | 2313 | 0.2% |
| 12-15 | 2313 | 0.2% |
| Other values (8) | 7516 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 976741 | ||
| n | 331267 | 8.3% |
| o | 331267 | 8.3% |
| m | 331267 | 8.3% |
| h | 331267 | 8.3% |
| t | 331267 | 8.3% |
| 2 | 328808 | 8.2% |
| 4 | 316151 | 7.9% |
| K | 314207 | 7.9% |
| > | 314207 | 7.9% |
| Other values (14) | 83578 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3990027 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 976741 | ||
| n | 331267 | 8.3% |
| o | 331267 | 8.3% |
| m | 331267 | 8.3% |
| h | 331267 | 8.3% |
| t | 331267 | 8.3% |
| 2 | 328808 | 8.2% |
| 4 | 316151 | 7.9% |
| K | 314207 | 7.9% |
| > | 314207 | 7.9% |
| Other values (14) | 83578 | 2.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3990027 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 976741 | ||
| n | 331267 | 8.3% |
| o | 331267 | 8.3% |
| m | 331267 | 8.3% |
| h | 331267 | 8.3% |
| t | 331267 | 8.3% |
| 2 | 328808 | 8.2% |
| 4 | 316151 | 7.9% |
| K | 314207 | 7.9% |
| > | 314207 | 7.9% |
| Other values (14) | 83578 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3990027 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 976741 | ||
| n | 331267 | 8.3% |
| o | 331267 | 8.3% |
| m | 331267 | 8.3% |
| h | 331267 | 8.3% |
| t | 331267 | 8.3% |
| 2 | 328808 | 8.2% |
| 4 | 316151 | 7.9% |
| K | 314207 | 7.9% |
| > | 314207 | 7.9% |
| Other values (14) | 83578 | 2.1% |
MONTANT
Real number (ℝ)
High correlation  Missing 
| Distinct | 2194 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 116364 |
| Missing (%) | 35.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5554.3982 |
| Minimum | 10 |
|---|---|
| Maximum | 290500 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.5 MiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 250 |
| Q1 | 1000 |
| median | 3000 |
| Q3 | 7400 |
| 95-th percentile | 18530.6 |
| Maximum | 290500 |
| Range | 290490 |
| Interquartile range (IQR) | 6400 |
Descriptive statistics
| Standard deviation | 7138.7462 |
|---|---|
| Coefficient of variation (CV) | 1.285242 |
| Kurtosis | 46.840777 |
| Mean | 5554.3982 |
| Median Absolute Deviation (MAD) | 2400 |
| Skewness | 4.0943968 |
| Sum | 1.1936568 × 109 |
| Variance | 50961697 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 500 | 17342 | 5.2% |
| 1000 | 12758 | 3.9% |
| 1500 | 7417 | 2.2% |
| 2000 | 7137 | 2.2% |
| 200 | 6198 | 1.9% |
| 3000 | 5392 | 1.6% |
| 2500 | 4898 | 1.5% |
| 4000 | 3733 | 1.1% |
| 3500 | 3683 | 1.1% |
| 100 | 3123 | 0.9% |
| Other values (2184) | 143222 | |
| (Missing) | 116364 |
| Value | Count | Frequency (%) |
| 10 | 1 | < 0.1% |
| 35 | 1 | < 0.1% |
| 50 | 42 | < 0.1% |
| 100 | 3123 | |
| 115 | 1 | < 0.1% |
| 125 | 1 | < 0.1% |
| 130 | 1 | < 0.1% |
| 150 | 271 | 0.1% |
| 192 | 1 | < 0.1% |
| 199 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 290500 | 1 | |
| 235500 | 1 | |
| 198000 | 1 | |
| 197400 | 1 | |
| 168000 | 1 | |
| 160500 | 1 | |
| 149700 | 1 | |
| 148400 | 1 | |
| 142415 | 1 | |
| 141500 | 1 |
FREQUENCE_RECH
Real number (ℝ)
High correlation  Missing 
| Distinct | 109 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 116364 |
| Missing (%) | 35.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.570923 |
| Minimum | 1 |
|---|---|
| Maximum | 133 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 7 |
| Q3 | 16 |
| 95-th percentile | 40 |
| Maximum | 133 |
| Range | 132 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 13.297436 |
|---|---|
| Coefficient of variation (CV) | 1.1492114 |
| Kurtosis | 5.2805668 |
| Mean | 11.570923 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 2.1059724 |
| Sum | 2486626 |
| Variance | 176.82181 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 33738 | 10.2% |
| 2 | 21112 | 6.4% |
| 3 | 17007 | 5.1% |
| 4 | 13724 | 4.1% |
| 5 | 11472 | 3.5% |
| 6 | 9887 | 3.0% |
| 7 | 8573 | 2.6% |
| 8 | 7601 | 2.3% |
| 9 | 6859 | 2.1% |
| 10 | 6282 | 1.9% |
| Other values (99) | 78648 | |
| (Missing) | 116364 |
| Value | Count | Frequency (%) |
| 1 | 33738 | |
| 2 | 21112 | |
| 3 | 17007 | |
| 4 | 13724 | |
| 5 | 11472 | 3.5% |
| 6 | 9887 | 3.0% |
| 7 | 8573 | 2.6% |
| 8 | 7601 | 2.3% |
| 9 | 6859 | 2.1% |
| 10 | 6282 | 1.9% |
| Value | Count | Frequency (%) |
| 133 | 1 | |
| 117 | 1 | |
| 115 | 1 | |
| 113 | 1 | |
| 110 | 1 | |
| 108 | 2 | |
| 106 | 2 | |
| 105 | 1 | |
| 103 | 1 | |
| 101 | 2 |
REVENUE
Real number (ℝ)
High correlation  Missing 
| Distinct | 22391 |
|---|---|
| Distinct (%) | 10.2% |
| Missing | 111611 |
| Missing (%) | 33.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5524.7335 |
| Minimum | 1 |
|---|---|
| Maximum | 221999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 199 |
| Q1 | 1000 |
| median | 3001 |
| Q3 | 7400 |
| 95-th percentile | 18750.25 |
| Maximum | 221999 |
| Range | 221998 |
| Interquartile range (IQR) | 6400 |
Descriptive statistics
| Standard deviation | 7139.7239 |
|---|---|
| Coefficient of variation (CV) | 1.2923201 |
| Kurtosis | 26.041386 |
| Mean | 5524.7335 |
| Median Absolute Deviation (MAD) | 2499 |
| Skewness | 3.484944 |
| Sum | 1.2135409 × 109 |
| Variance | 50975657 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 500 | 8957 | 2.7% |
| 1000 | 5539 | 1.7% |
| 1500 | 3177 | 1.0% |
| 200 | 3034 | 0.9% |
| 2000 | 2832 | 0.9% |
| 3000 | 2041 | 0.6% |
| 2500 | 1922 | 0.6% |
| 3500 | 1371 | 0.4% |
| 4000 | 1310 | 0.4% |
| 100 | 1257 | 0.4% |
| Other values (22381) | 188216 | |
| (Missing) | 111611 |
| Value | Count | Frequency (%) |
| 1 | 678 | |
| 2 | 497 | |
| 3 | 26 | < 0.1% |
| 4 | 303 | |
| 5 | 12 | < 0.1% |
| 6 | 160 | < 0.1% |
| 7 | 80 | < 0.1% |
| 8 | 199 | 0.1% |
| 9 | 187 | 0.1% |
| 10 | 421 |
| Value | Count | Frequency (%) |
| 221999 | 1 | |
| 162500 | 1 | |
| 147739 | 1 | |
| 146660 | 1 | |
| 144500 | 1 | |
| 132900 | 1 | |
| 129082 | 1 | |
| 126314 | 1 | |
| 124000 | 1 | |
| 121998 | 1 |
ARPU_SEGMENT
Real number (ℝ)
High correlation  Missing 
| Distinct | 10555 |
|---|---|
| Distinct (%) | 4.8% |
| Missing | 111611 |
| Missing (%) | 33.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1841.5839 |
| Minimum | 0 |
|---|---|
| Maximum | 74000 |
| Zeros | 678 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 66 |
| Q1 | 333 |
| median | 1000 |
| Q3 | 2467 |
| 95-th percentile | 6250 |
| Maximum | 74000 |
| Range | 74000 |
| Interquartile range (IQR) | 2134 |
Descriptive statistics
| Standard deviation | 2379.9044 |
|---|---|
| Coefficient of variation (CV) | 1.2923138 |
| Kurtosis | 26.041631 |
| Mean | 1841.5839 |
| Median Absolute Deviation (MAD) | 833 |
| Skewness | 3.484959 |
| Sum | 4.0451496 × 108 |
| Variance | 5663945 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 167 | 10339 | 3.1% |
| 333 | 6660 | 2.0% |
| 500 | 4356 | 1.3% |
| 667 | 3519 | 1.1% |
| 67 | 3479 | 1.1% |
| 1000 | 2864 | 0.9% |
| 833 | 2394 | 0.7% |
| 1167 | 1791 | 0.5% |
| 1333 | 1685 | 0.5% |
| 33 | 1656 | 0.5% |
| Other values (10545) | 180913 | |
| (Missing) | 111611 |
| Value | Count | Frequency (%) |
| 0 | 678 | |
| 1 | 826 | |
| 2 | 252 | 0.1% |
| 3 | 807 | |
| 4 | 437 | |
| 5 | 275 | 0.1% |
| 6 | 187 | 0.1% |
| 7 | 550 | |
| 8 | 117 | < 0.1% |
| 9 | 178 | 0.1% |
| Value | Count | Frequency (%) |
| 74000 | 1 | |
| 54167 | 1 | |
| 49246 | 1 | |
| 48887 | 1 | |
| 48167 | 1 | |
| 44300 | 1 | |
| 43027 | 1 | |
| 42105 | 1 | |
| 41333 | 1 | |
| 40666 | 1 |
FREQUENCE
Real number (ℝ)
High correlation  Missing 
| Distinct | 91 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 111611 |
| Missing (%) | 33.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.007571 |
| Minimum | 1 |
|---|---|
| Maximum | 91 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 9 |
| Q3 | 20 |
| 95-th percentile | 45 |
| Maximum | 91 |
| Range | 90 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 14.712164 |
|---|---|
| Coefficient of variation (CV) | 1.0503009 |
| Kurtosis | 3.4119982 |
| Mean | 14.007571 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 1.7755519 |
| Sum | 3076847 |
| Variance | 216.44778 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 24893 | 7.5% |
| 2 | 17668 | 5.3% |
| 3 | 14608 | 4.4% |
| 4 | 12744 | 3.8% |
| 5 | 11037 | 3.3% |
| 6 | 9836 | 3.0% |
| 7 | 8866 | 2.7% |
| 8 | 7945 | 2.4% |
| 9 | 7360 | 2.2% |
| 10 | 6655 | 2.0% |
| Other values (81) | 98044 | |
| (Missing) | 111611 |
| Value | Count | Frequency (%) |
| 1 | 24893 | |
| 2 | 17668 | |
| 3 | 14608 | |
| 4 | 12744 | |
| 5 | 11037 | |
| 6 | 9836 | 3.0% |
| 7 | 8866 | 2.7% |
| 8 | 7945 | 2.4% |
| 9 | 7360 | 2.2% |
| 10 | 6655 | 2.0% |
| Value | Count | Frequency (%) |
| 91 | 14 | < 0.1% |
| 90 | 12 | < 0.1% |
| 89 | 17 | < 0.1% |
| 88 | 30 | |
| 87 | 42 | |
| 86 | 41 | |
| 85 | 38 | |
| 84 | 44 | |
| 83 | 60 | |
| 82 | 67 |
DATA_VOLUME
Real number (ℝ)
Missing  Skewed  Zeros 
| Distinct | 20477 |
|---|---|
| Distinct (%) | 12.2% |
| Missing | 163104 |
| Missing (%) | 49.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3378.6694 |
| Minimum | 0 |
|---|---|
| Maximum | 926547 |
| Zeros | 49051 |
| Zeros (%) | 14.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 266 |
| Q3 | 2913 |
| 95-th percentile | 15084.9 |
| Maximum | 926547 |
| Range | 926547 |
| Interquartile range (IQR) | 2913 |
Descriptive statistics
| Standard deviation | 12446.492 |
|---|---|
| Coefficient of variation (CV) | 3.6838441 |
| Kurtosis | 1379.4652 |
| Mean | 3378.6694 |
| Median Absolute Deviation (MAD) | 266 |
| Skewness | 28.114949 |
| Sum | 5.6816718 × 108 |
| Variance | 1.5491515 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 49051 | 14.8% |
| 1 | 6286 | 1.9% |
| 2 | 2079 | 0.6% |
| 3 | 1069 | 0.3% |
| 4 | 870 | 0.3% |
| 1024 | 863 | 0.3% |
| 5 | 720 | 0.2% |
| 1023 | 608 | 0.2% |
| 6 | 573 | 0.2% |
| 7 | 496 | 0.1% |
| Other values (20467) | 105548 | |
| (Missing) | 163104 |
| Value | Count | Frequency (%) |
| 0 | 49051 | |
| 1 | 6286 | 1.9% |
| 2 | 2079 | 0.6% |
| 3 | 1069 | 0.3% |
| 4 | 870 | 0.3% |
| 5 | 720 | 0.2% |
| 6 | 573 | 0.2% |
| 7 | 496 | 0.1% |
| 8 | 468 | 0.1% |
| 9 | 419 | 0.1% |
| Value | Count | Frequency (%) |
| 926547 | 1 | |
| 885642 | 1 | |
| 880424 | 1 | |
| 867127 | 1 | |
| 758167 | 1 | |
| 752018 | 1 | |
| 720309 | 1 | |
| 693842 | 1 | |
| 636022 | 1 | |
| 623124 | 1 |
ON_NET
Real number (ℝ)
High correlation  Missing  Zeros 
| Distinct | 5587 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 121131 |
| Missing (%) | 36.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 278.62195 |
| Minimum | 0 |
|---|---|
| Maximum | 50809 |
| Zeros | 16552 |
| Zeros (%) | 5.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 5 |
| median | 27 |
| Q3 | 158 |
| 95-th percentile | 1359 |
| Maximum | 50809 |
| Range | 50809 |
| Interquartile range (IQR) | 153 |
Descriptive statistics
| Standard deviation | 876.93852 |
|---|---|
| Coefficient of variation (CV) | 3.1474136 |
| Kurtosis | 151.48689 |
| Mean | 278.62195 |
| Median Absolute Deviation (MAD) | 26 |
| Skewness | 8.6456016 |
| Sum | 58548503 |
| Variance | 769021.16 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16552 | 5.0% |
| 1 | 14172 | 4.3% |
| 2 | 8833 | 2.7% |
| 3 | 6486 | 2.0% |
| 7 | 6310 | 1.9% |
| 8 | 6012 | 1.8% |
| 4 | 5868 | 1.8% |
| 5 | 4558 | 1.4% |
| 6 | 4501 | 1.4% |
| 9 | 3013 | 0.9% |
| Other values (5577) | 133831 | |
| (Missing) | 121131 |
| Value | Count | Frequency (%) |
| 0 | 16552 | |
| 1 | 14172 | |
| 2 | 8833 | |
| 3 | 6486 | 2.0% |
| 4 | 5868 | 1.8% |
| 5 | 4558 | 1.4% |
| 6 | 4501 | 1.4% |
| 7 | 6310 | 1.9% |
| 8 | 6012 | 1.8% |
| 9 | 3013 | 0.9% |
| Value | Count | Frequency (%) |
| 50809 | 1 | |
| 30425 | 1 | |
| 25263 | 1 | |
| 25225 | 1 | |
| 24482 | 1 | |
| 24293 | 1 | |
| 24281 | 1 | |
| 23804 | 1 | |
| 23595 | 1 | |
| 21930 | 1 |
ORANGE
Real number (ℝ)
High correlation  Missing  Zeros 
| Distinct | 1967 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 137819 |
| Missing (%) | 41.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 96.072397 |
| Minimum | 0 |
|---|---|
| Maximum | 6429 |
| Zeros | 9496 |
| Zeros (%) | 2.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 7 |
| median | 29 |
| Q3 | 99 |
| 95-th percentile | 396 |
| Maximum | 6429 |
| Range | 6429 |
| Interquartile range (IQR) | 92 |
Descriptive statistics
| Standard deviation | 206.70985 |
|---|---|
| Coefficient of variation (CV) | 2.151605 |
| Kurtosis | 94.783421 |
| Mean | 96.072397 |
| Median Absolute Deviation (MAD) | 27 |
| Skewness | 7.3046543 |
| Sum | 18585013 |
| Variance | 42728.961 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 10526 | 3.2% |
| 0 | 9496 | 2.9% |
| 2 | 7554 | 2.3% |
| 3 | 5572 | 1.7% |
| 4 | 5158 | 1.6% |
| 8 | 3962 | 1.2% |
| 5 | 3801 | 1.1% |
| 6 | 3380 | 1.0% |
| 10 | 3197 | 1.0% |
| 7 | 3175 | 1.0% |
| Other values (1957) | 137627 | |
| (Missing) | 137819 |
| Value | Count | Frequency (%) |
| 0 | 9496 | |
| 1 | 10526 | |
| 2 | 7554 | |
| 3 | 5572 | |
| 4 | 5158 | |
| 5 | 3801 | 1.1% |
| 6 | 3380 | 1.0% |
| 7 | 3175 | 1.0% |
| 8 | 3962 | 1.2% |
| 9 | 3121 | 0.9% |
| Value | Count | Frequency (%) |
| 6429 | 1 | |
| 6211 | 1 | |
| 6005 | 1 | |
| 5841 | 1 | |
| 5592 | 1 | |
| 5429 | 1 | |
| 5250 | 1 | |
| 5074 | 1 | |
| 4889 | 1 | |
| 4747 | 1 |
TIGO
Real number (ℝ)
Missing  Zeros 
| Distinct | 755 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 198144 |
| Missing (%) | 59.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.101921 |
| Minimum | 0 |
|---|---|
| Maximum | 2899 |
| Zeros | 14458 |
| Zeros (%) | 4.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 6 |
| Q3 | 20 |
| 95-th percentile | 95 |
| Maximum | 2899 |
| Range | 2899 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 62.385394 |
|---|---|
| Coefficient of variation (CV) | 2.7004419 |
| Kurtosis | 282.43796 |
| Mean | 23.101921 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 12.004446 |
| Sum | 3075397 |
| Variance | 3891.9374 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 17225 | 5.2% |
| 0 | 14458 | 4.4% |
| 2 | 11121 | 3.4% |
| 3 | 8227 | 2.5% |
| 4 | 6599 | 2.0% |
| 5 | 5351 | 1.6% |
| 6 | 4612 | 1.4% |
| 7 | 3979 | 1.2% |
| 8 | 3821 | 1.2% |
| 9 | 3267 | 1.0% |
| Other values (745) | 54463 | 16.4% |
| (Missing) | 198144 |
| Value | Count | Frequency (%) |
| 0 | 14458 | |
| 1 | 17225 | |
| 2 | 11121 | |
| 3 | 8227 | |
| 4 | 6599 | 2.0% |
| 5 | 5351 | 1.6% |
| 6 | 4612 | 1.4% |
| 7 | 3979 | 1.2% |
| 8 | 3821 | 1.2% |
| 9 | 3267 | 1.0% |
| Value | Count | Frequency (%) |
| 2899 | 1 | |
| 2758 | 1 | |
| 2663 | 1 | |
| 2625 | 1 | |
| 2568 | 1 | |
| 2554 | 1 | |
| 1896 | 1 | |
| 1756 | 1 | |
| 1732 | 1 | |
| 1694 | 1 |
ZONE1
Real number (ℝ)
Missing  Zeros 
| Distinct | 319 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 305097 |
| Missing (%) | 92.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.1801681 |
| Minimum | 0 |
|---|---|
| Maximum | 1867 |
| Zeros | 9212 |
| Zeros (%) | 2.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 32 |
| Maximum | 1867 |
| Range | 1867 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 40.985521 |
|---|---|
| Coefficient of variation (CV) | 5.010352 |
| Kurtosis | 598.01951 |
| Mean | 8.1801681 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 19.624853 |
| Sum | 214075 |
| Variance | 1679.813 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 9212 | 2.8% |
| 1 | 6456 | 1.9% |
| 2 | 2544 | 0.8% |
| 3 | 1397 | 0.4% |
| 4 | 938 | 0.3% |
| 5 | 659 | 0.2% |
| 6 | 511 | 0.2% |
| 7 | 392 | 0.1% |
| 8 | 337 | 0.1% |
| 9 | 315 | 0.1% |
| Other values (309) | 3409 | 1.0% |
| (Missing) | 305097 |
| Value | Count | Frequency (%) |
| 0 | 9212 | |
| 1 | 6456 | |
| 2 | 2544 | 0.8% |
| 3 | 1397 | 0.4% |
| 4 | 938 | 0.3% |
| 5 | 659 | 0.2% |
| 6 | 511 | 0.2% |
| 7 | 392 | 0.1% |
| 8 | 337 | 0.1% |
| 9 | 315 | 0.1% |
| Value | Count | Frequency (%) |
| 1867 | 1 | |
| 1609 | 1 | |
| 1529 | 1 | |
| 1469 | 1 | |
| 1427 | 1 | |
| 1360 | 1 | |
| 1220 | 1 | |
| 963 | 1 | |
| 883 | 1 | |
| 865 | 1 |
ZONE2
Real number (ℝ)
Missing  Zeros 
| Distinct | 236 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 310242 |
| Missing (%) | 93.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.2950297 |
| Minimum | 0 |
|---|---|
| Maximum | 1346 |
| Zeros | 6217 |
| Zeros (%) | 1.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 5 |
| 95-th percentile | 27 |
| Maximum | 1346 |
| Range | 1346 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 30.213219 |
|---|---|
| Coefficient of variation (CV) | 4.141617 |
| Kurtosis | 567.06608 |
| Mean | 7.2950297 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 19.146683 |
| Sum | 153378 |
| Variance | 912.83861 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 6217 | 1.9% |
| 1 | 4132 | 1.2% |
| 2 | 2340 | 0.7% |
| 3 | 1502 | 0.5% |
| 4 | 1140 | 0.3% |
| 5 | 793 | 0.2% |
| 6 | 561 | 0.2% |
| 7 | 497 | 0.2% |
| 8 | 380 | 0.1% |
| 9 | 320 | 0.1% |
| Other values (226) | 3143 | 0.9% |
| (Missing) | 310242 |
| Value | Count | Frequency (%) |
| 0 | 6217 | |
| 1 | 4132 | |
| 2 | 2340 | 0.7% |
| 3 | 1502 | 0.5% |
| 4 | 1140 | 0.3% |
| 5 | 793 | 0.2% |
| 6 | 561 | 0.2% |
| 7 | 497 | 0.2% |
| 8 | 380 | 0.1% |
| 9 | 320 | 0.1% |
| Value | Count | Frequency (%) |
| 1346 | 1 | |
| 1174 | 1 | |
| 1039 | 1 | |
| 1007 | 1 | |
| 932 | 1 | |
| 799 | 1 | |
| 702 | 1 | |
| 660 | 1 | |
| 639 | 1 | |
| 608 | 1 |
MRG
Boolean
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 323.6 KiB |
| False |
|---|
| Value | Count | Frequency (%) |
| False | 331267 |
REGULARITY
Real number (ℝ)
High correlation 
| Distinct | 62 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.063873 |
| Minimum | 1 |
|---|---|
| Maximum | 62 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 6 |
| median | 24 |
| Q3 | 51 |
| 95-th percentile | 62 |
| Maximum | 62 |
| Range | 61 |
| Interquartile range (IQR) | 45 |
Descriptive statistics
| Standard deviation | 22.288201 |
|---|---|
| Coefficient of variation (CV) | 0.79419548 |
| Kurtosis | -1.4869465 |
| Mean | 28.063873 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | 0.24551932 |
| Sum | 9296635 |
| Variance | 496.76391 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 30160 | 9.1% |
| 62 | 25840 | 7.8% |
| 2 | 18352 | 5.5% |
| 3 | 13021 | 3.9% |
| 4 | 10496 | 3.2% |
| 61 | 9792 | 3.0% |
| 5 | 8704 | 2.6% |
| 6 | 7749 | 2.3% |
| 60 | 7328 | 2.2% |
| 7 | 6849 | 2.1% |
| Other values (52) | 192976 |
| Value | Count | Frequency (%) |
| 1 | 30160 | |
| 2 | 18352 | |
| 3 | 13021 | |
| 4 | 10496 | 3.2% |
| 5 | 8704 | 2.6% |
| 6 | 7749 | 2.3% |
| 7 | 6849 | 2.1% |
| 8 | 6301 | 1.9% |
| 9 | 5653 | 1.7% |
| 10 | 5283 | 1.6% |
| Value | Count | Frequency (%) |
| 62 | 25840 | |
| 61 | 9792 | 3.0% |
| 60 | 7328 | 2.2% |
| 59 | 6182 | 1.9% |
| 58 | 5288 | 1.6% |
| 57 | 4814 | 1.5% |
| 56 | 4445 | 1.3% |
| 55 | 4205 | 1.3% |
| 54 | 4067 | 1.2% |
| 53 | 3836 | 1.2% |
TOP_PACK
Text
Missing 
| Distinct | 108 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 138593 |
| Missing (%) | 41.8% |
| Memory size | 2.5 MiB |
Length
| Max length | 49 |
|---|---|
| Median length | 42 |
| Mean length | 23.186102 |
| Min length | 2 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | On net 200F=Unlimited _call24H |
|---|---|
| 2nd row | On-net 1000F=10MilF;10d |
| 3rd row | Data:1000F=5GB,7d |
| 4th row | Mixt 250F=Unlimited_call24H |
| 5th row | MIXT:500F= 2500F on net _2500F off net;2d |
| Value | Count | Frequency (%) |
| all-net | 59457 | 12.4% |
| 500f=2000f;5d | 48735 | 10.1% |
| net | 39624 | 8.2% |
| on | 36664 | 7.6% |
| 200f=unlimited | 23422 | 4.9% |
| call24h | 23422 | 4.9% |
| 2500f | 19978 | 4.2% |
| data | 19690 | 4.1% |
| data:490f=1gb,7d | 17794 | 3.7% |
| mixt | 14245 | 3.0% |
| Other values (147) | 177487 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 629738 | 14.1% |
| 287844 | 6.4% | |
| l | 270658 | 6.1% |
| F | 259719 | 5.8% |
| t | 248873 | 5.6% |
| n | 224266 | 5.0% |
| 2 | 209273 | 4.7% |
| e | 180460 | 4.0% |
| a | 171720 | 3.8% |
| 5 | 170184 | 3.8% |
| Other values (61) | 1814624 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4467359 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 629738 | 14.1% |
| 287844 | 6.4% | |
| l | 270658 | 6.1% |
| F | 259719 | 5.8% |
| t | 248873 | 5.6% |
| n | 224266 | 5.0% |
| 2 | 209273 | 4.7% |
| e | 180460 | 4.0% |
| a | 171720 | 3.8% |
| 5 | 170184 | 3.8% |
| Other values (61) | 1814624 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4467359 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 629738 | 14.1% |
| 287844 | 6.4% | |
| l | 270658 | 6.1% |
| F | 259719 | 5.8% |
| t | 248873 | 5.6% |
| n | 224266 | 5.0% |
| 2 | 209273 | 4.7% |
| e | 180460 | 4.0% |
| a | 171720 | 3.8% |
| 5 | 170184 | 3.8% |
| Other values (61) | 1814624 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4467359 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 629738 | 14.1% |
| 287844 | 6.4% | |
| l | 270658 | 6.1% |
| F | 259719 | 5.8% |
| t | 248873 | 5.6% |
| n | 224266 | 5.0% |
| 2 | 209273 | 4.7% |
| e | 180460 | 4.0% |
| a | 171720 | 3.8% |
| 5 | 170184 | 3.8% |
| Other values (61) | 1814624 |
FREQ_TOP_PACK
Real number (ℝ)
High correlation  Missing 
| Distinct | 165 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 138594 |
| Missing (%) | 41.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.2994504 |
| Minimum | 1 |
|---|---|
| Maximum | 592 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 5 |
| Q3 | 12 |
| 95-th percentile | 33 |
| Maximum | 592 |
| Range | 591 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 12.301065 |
|---|---|
| Coefficient of variation (CV) | 1.3227733 |
| Kurtosis | 49.162582 |
| Mean | 9.2994504 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 3.9116439 |
| Sum | 1791753 |
| Variance | 151.3162 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 38577 | 11.6% |
| 2 | 23737 | 7.2% |
| 3 | 18152 | 5.5% |
| 4 | 13151 | 4.0% |
| 5 | 10468 | 3.2% |
| 6 | 8813 | 2.7% |
| 7 | 7637 | 2.3% |
| 8 | 6734 | 2.0% |
| 9 | 5916 | 1.8% |
| 10 | 5350 | 1.6% |
| Other values (155) | 54138 | 16.3% |
| (Missing) | 138594 |
| Value | Count | Frequency (%) |
| 1 | 38577 | |
| 2 | 23737 | |
| 3 | 18152 | |
| 4 | 13151 | 4.0% |
| 5 | 10468 | 3.2% |
| 6 | 8813 | 2.7% |
| 7 | 7637 | 2.3% |
| 8 | 6734 | 2.0% |
| 9 | 5916 | 1.8% |
| 10 | 5350 | 1.6% |
| Value | Count | Frequency (%) |
| 592 | 1 | |
| 316 | 1 | |
| 308 | 1 | |
| 304 | 1 | |
| 294 | 1 | |
| 262 | 1 | |
| 217 | 1 | |
| 215 | 1 | |
| 214 | 1 | |
| 206 | 1 |
CHURN
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 2.5 MiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 269268 | |
| 1.0 | 61998 | 18.7% |
| (Missing) | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 269268 | |
| 1.0 | 61998 | 18.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 600534 | |
| . | 331266 | |
| 1 | 61998 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 993798 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 600534 | |
| . | 331266 | |
| 1 | 61998 | 6.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 993798 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 600534 | |
| . | 331266 | |
| 1 | 61998 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 993798 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 600534 | |
| . | 331266 | |
| 1 | 61998 | 6.2% |
Interactions
Correlations
| ARPU_SEGMENT | CHURN | DATA_VOLUME | FREQUENCE | FREQUENCE_RECH | FREQ_TOP_PACK | MONTANT | ON_NET | ORANGE | REGION | REGULARITY | REVENUE | TENURE | TIGO | ZONE1 | ZONE2 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ARPU_SEGMENT | 1.000 | 0.036 | 0.390 | 0.881 | 0.880 | 0.816 | 0.987 | 0.522 | 0.678 | 0.028 | 0.716 | 1.000 | 0.014 | 0.451 | 0.216 | 0.311 |
| CHURN | 0.036 | 1.000 | 0.004 | 0.147 | 0.108 | 0.017 | 0.023 | 0.015 | 0.024 | 0.032 | 0.555 | 0.036 | 0.051 | 0.007 | 0.000 | 0.026 |
| DATA_VOLUME | 0.390 | 0.004 | 1.000 | 0.331 | 0.296 | 0.229 | 0.380 | -0.095 | -0.019 | 0.015 | 0.306 | 0.390 | 0.030 | -0.010 | -0.024 | -0.002 |
| FREQUENCE | 0.881 | 0.147 | 0.331 | 1.000 | 0.952 | 0.867 | 0.871 | 0.441 | 0.530 | 0.054 | 0.691 | 0.881 | 0.004 | 0.336 | 0.079 | 0.188 |
| FREQUENCE_RECH | 0.880 | 0.108 | 0.296 | 0.952 | 1.000 | 0.895 | 0.887 | 0.478 | 0.563 | 0.047 | 0.678 | 0.880 | 0.000 | 0.364 | 0.086 | 0.181 |
| FREQ_TOP_PACK | 0.816 | 0.017 | 0.229 | 0.867 | 0.895 | 1.000 | 0.811 | 0.439 | 0.538 | 0.016 | 0.596 | 0.817 | 0.001 | 0.351 | 0.093 | 0.062 |
| MONTANT | 0.987 | 0.023 | 0.380 | 0.871 | 0.887 | 0.811 | 1.000 | 0.512 | 0.669 | 0.021 | 0.708 | 0.987 | 0.011 | 0.448 | 0.213 | 0.310 |
| ON_NET | 0.522 | 0.015 | -0.095 | 0.441 | 0.478 | 0.439 | 0.512 | 1.000 | 0.550 | 0.009 | 0.525 | 0.522 | 0.000 | 0.370 | 0.064 | -0.022 |
| ORANGE | 0.678 | 0.024 | -0.019 | 0.530 | 0.563 | 0.538 | 0.669 | 0.550 | 1.000 | 0.016 | 0.456 | 0.678 | 0.014 | 0.470 | 0.120 | 0.052 |
| REGION | 0.028 | 0.032 | 0.015 | 0.054 | 0.047 | 0.016 | 0.021 | 0.009 | 0.016 | 1.000 | 0.037 | 0.028 | 0.024 | 0.012 | 0.000 | 0.000 |
| REGULARITY | 0.716 | 0.555 | 0.306 | 0.691 | 0.678 | 0.596 | 0.708 | 0.525 | 0.456 | 0.037 | 1.000 | 0.716 | 0.017 | 0.323 | 0.047 | 0.044 |
| REVENUE | 1.000 | 0.036 | 0.390 | 0.881 | 0.880 | 0.817 | 0.987 | 0.522 | 0.678 | 0.028 | 0.716 | 1.000 | 0.014 | 0.451 | 0.215 | 0.311 |
| TENURE | 0.014 | 0.051 | 0.030 | 0.004 | 0.000 | 0.001 | 0.011 | 0.000 | 0.014 | 0.024 | 0.017 | 0.014 | 1.000 | 0.000 | 0.022 | 0.072 |
| TIGO | 0.451 | 0.007 | -0.010 | 0.336 | 0.364 | 0.351 | 0.448 | 0.370 | 0.470 | 0.012 | 0.323 | 0.451 | 0.000 | 1.000 | 0.075 | 0.018 |
| ZONE1 | 0.216 | 0.000 | -0.024 | 0.079 | 0.086 | 0.093 | 0.213 | 0.064 | 0.120 | 0.000 | 0.047 | 0.215 | 0.022 | 0.075 | 1.000 | 0.107 |
| ZONE2 | 0.311 | 0.026 | -0.002 | 0.188 | 0.181 | 0.062 | 0.310 | -0.022 | 0.052 | 0.000 | 0.044 | 0.311 | 0.072 | 0.018 | 0.107 | 1.000 |
Missing values
Sample
| user_id | REGION | TENURE | MONTANT | FREQUENCE_RECH | REVENUE | ARPU_SEGMENT | FREQUENCE | DATA_VOLUME | ON_NET | ORANGE | TIGO | ZONE1 | ZONE2 | MRG | REGULARITY | TOP_PACK | FREQ_TOP_PACK | CHURN | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 00000bfd7d50f01092811bc0c8d7b0d6fe7c3596 | FATICK | K > 24 month | 4250.0 | 15.0 | 4251.0 | 1417.0 | 17.0 | 4.0 | 388.0 | 46.0 | 1.0 | 1.0 | 2.0 | NO | 54 | On net 200F=Unlimited _call24H | 8.0 | 0.0 |
| 1 | 00000cb4a5d760de88fecb38e2f71b7bec52e834 | NaN | I 18-21 month | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NO | 4 | NaN | NaN | 1.0 |
| 2 | 00001654a9d9f96303d9969d0a4a851714a4bb57 | NaN | K > 24 month | 3600.0 | 2.0 | 1020.0 | 340.0 | 2.0 | NaN | 90.0 | 46.0 | 7.0 | NaN | NaN | NO | 17 | On-net 1000F=10MilF;10d | 1.0 | 0.0 |
| 3 | 00001dd6fa45f7ba044bd5d84937be464ce78ac2 | DAKAR | K > 24 month | 13500.0 | 15.0 | 13502.0 | 4501.0 | 18.0 | 43804.0 | 41.0 | 102.0 | 2.0 | NaN | NaN | NO | 62 | Data:1000F=5GB,7d | 11.0 | 0.0 |
| 4 | 000028d9e13a595abe061f9b58f3d76ab907850f | DAKAR | K > 24 month | 1000.0 | 1.0 | 985.0 | 328.0 | 1.0 | NaN | 39.0 | 24.0 | NaN | NaN | NaN | NO | 11 | Mixt 250F=Unlimited_call24H | 2.0 | 0.0 |
| 5 | 0000296564272665ccd2925d377e124f3306b01e | LOUGA | K > 24 month | 8500.0 | 17.0 | 9000.0 | 3000.0 | 18.0 | NaN | 252.0 | 70.0 | 91.0 | NaN | NaN | NO | 62 | MIXT:500F= 2500F on net _2500F off net;2d | 18.0 | 0.0 |
| 6 | 00002b0ed56e2c199ec8c3021327229afa70f063 | LOUGA | K > 24 month | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NO | 2 | NaN | NaN | 0.0 |
| 7 | 0000313946b6849745963442c6e572d47cd24ced | DAKAR | K > 24 month | 7000.0 | 16.0 | 7229.0 | 2410.0 | 22.0 | 1601.0 | 77.0 | 29.0 | 100.0 | NaN | NaN | NO | 55 | All-net 500F=2000F;5d | 8.0 | 0.0 |
| 8 | 0000398021ccd3a488fa1a63dee3b2f0d471f9fd | DAKAR | K > 24 month | 1500.0 | 3.0 | 1502.0 | 501.0 | 12.0 | NaN | 2.0 | 53.0 | 2.0 | NaN | NaN | NO | 31 | NaN | NaN | 0.0 |
| 9 | 00003d165737109921ebd21f883cb8cff028b626 | TAMBACOUNDA | K > 24 month | 4000.0 | 8.0 | 4000.0 | 1333.0 | 8.0 | NaN | 1620.0 | 9.0 | NaN | NaN | NaN | NO | 45 | On-net 500F_FNF;3d | 8.0 | 0.0 |
| user_id | REGION | TENURE | MONTANT | FREQUENCE_RECH | REVENUE | ARPU_SEGMENT | FREQUENCE | DATA_VOLUME | ON_NET | ORANGE | TIGO | ZONE1 | ZONE2 | MRG | REGULARITY | TOP_PACK | FREQ_TOP_PACK | CHURN | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 331257 | 276e5b3ea0f146740ea1df8a4c349c14d9e17f69 | NaN | K > 24 month | 500.0 | 1.0 | 500.0 | 167.0 | 1.0 | NaN | 8.0 | NaN | NaN | NaN | NaN | NO | 12 | On-net 500=4000,10d | 1.0 | 0.0 |
| 331258 | 276e6110fc32f41b35088354be98dafabd50e368 | LOUGA | K > 24 month | 3000.0 | 3.0 | 2987.0 | 996.0 | 7.0 | NaN | 16.0 | 15.0 | 2.0 | 3.0 | NaN | NO | 21 | NaN | NaN | 0.0 |
| 331259 | 276e68d1807217b3e7876e6fcd0cc540a8972bdd | DAKAR | K > 24 month | 3000.0 | 4.0 | 2999.0 | 1000.0 | 7.0 | 6763.0 | 4.0 | 38.0 | 1.0 | NaN | NaN | NO | 54 | Data:1000F=2GB,30d | 3.0 | 0.0 |
| 331260 | 276e69675e2cf6b3999695469721f854a1727361 | DAKAR | K > 24 month | 3500.0 | 6.0 | 3499.0 | 1166.0 | 8.0 | NaN | 51.0 | 74.0 | NaN | NaN | NaN | NO | 33 | All-net 500F=2000F;5d | 3.0 | 0.0 |
| 331261 | 276e72a09dd82f06403fa653a92a3da7d5da767b | NaN | K > 24 month | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NO | 1 | NaN | NaN | 1.0 |
| 331262 | 276e80e1438ceef01c586fbced93ad064e2ea4fb | NaN | K > 24 month | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NO | 1 | NaN | NaN | 0.0 |
| 331263 | 276e81372ba8b1fa4092dd0f986af500e8724667 | NaN | K > 24 month | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NO | 1 | NaN | NaN | 0.0 |
| 331264 | 276e862452e4f2c28a5a56b48ad17abe41ed553e | NaN | K > 24 month | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NO | 10 | NaN | NaN | 1.0 |
| 331265 | 276e89e10fb5d85cd567e8d50a3f2bbdfbd10eb9 | DAKAR | K > 24 month | 500.0 | 1.0 | 502.0 | 167.0 | 2.0 | NaN | 46.0 | 18.0 | NaN | NaN | NaN | NO | 6 | All-net 500F=2000F;5d | 1.0 | 0.0 |
| 331266 | 276e99744062bb493e52ef26a09e1dfc5446cea6 | KAFFRINE | K > 24 month | 3500.0 | 7.0 | 3500.0 | 1167.0 | 7.0 | NaN | 58.0 | 88.0 | 2.0 | NaN | NaN | NO | 48 | MI | NaN | NaN |